Categorical spectral analysis of periodicity in human and viral genomes

نویسندگان

  • Elizabeth D. Howe
  • Jun S. Song
چکیده

Periodicity in nucleotide sequences arises from regular repeating patterns which may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has provided the basis for powerful gene prediction algorithms, its origins are still not fully understood. Here, we show that, contrary to common belief, amino acid (AA) bias and codon usage bias are insufficient to create base-3 periodicity. This article applies the rigorous method of spectral envelope to systematically characterize the contributions of codon bias, AA bias and protein structural motifs to the three-base periodicity of coding sequences. The method is also used to classify CpG islands in the human genome. In addition, we show how spectral envelope can be used to trace the evolution of viral genomes and monitor global sequence changes without having to align to previously known genomes. This approach also detects reassortment events, such as those that led to the 2009 pandemic H1N1 virus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Categorical spectral analysis of periodicity in nucleosomal DNA

DNA helical twist imposes geometric constraints on the location of histone-DNA interaction sites along nucleosomal DNA. Certain 10.5-bp periodic nucleotides in phase with these geometric constraints have been suggested to facilitate nucleosome positioning. However, the extent of nucleotide periodicity in nucleosomal DNA and its significance in directing nucleosome positioning still remain uncle...

متن کامل

HeteroGenome: database of genome periodicity

We present the first release of the HeteroGenome database collecting latent periodicity regions in genomes. Tandem repeats and highly divergent tandem repeats along with the regions of a new type of periodicity, known as profile periodicity, have been collected for the genomes of Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans and Drosophila melanogaster. We obtained data...

متن کامل

Induction of Nucleic Acid Damage in Viral Genomes using Riboflavin in Combination with UV Light

Background and Aims: Despite the screening of blood donors, blood transfusion represents an ideal port of entry for blood-borne infection. Blood-borne pathogen transmission has been a concern since the earliest days of transfusion. The blood product of platelet (PLT) concentrates is still faced with the risk of bacterial and viral contaminations. Pathogen inactivation technologies offer a proac...

متن کامل

Are Categorical Periodograms and Indicator Sequences of Genomes Spectrally Equivalent?

This paper reports a novel symbol-to-signal mapping for DNA sequences, based on the concept of categorical periodograms. A categorical periodogram is a numeric sequence with the n-th element of the sequence indicating the number of occurrences of cycles with period n in it. The period of the cycle is defined as the number of intervening events plus one. Spectral analysis studies have been condu...

متن کامل

Induction of Nucleic Acid Damage in Viral Genomes Using Riboflavin in Combination with UV Light

Background and Aims: Despite the screening of blood donors, blood transfusion represents an ideal port of entry for blood-borne infection. Blood-borne pathogen transmission has been a concern since the earliest days of transfusion. The blood product of platelet (PLT) concentrates is still faced with the risk of bacterial and viral contaminations. Pathogen inactivation technologies offer a proac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 41  شماره 

صفحات  -

تاریخ انتشار 2013